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1. INTRODUCTION 


Finite sets of points can be represented by finite lists of points, but it is impossible to represent finite 
lists of points with finite sets of points. This is because a set of points has no distinguished member and 
therefore it is impossible to define a function which maps a set of points to a "first" point in that set. While 
this simple observation seems straightforward and correct, it is very difficult to prove. What is meant by 
"represent" or by "a distinguished member"? Intuitively a list has more structure or contains more 
information than a set. But what does this mean? One approach to defining the meaning of 
"representations", "distinguished members" and "more structure" is to study the general nature of 
mathematical objects (such as sets and sequences). One approach to the general nature of mathematical 
objects is set theory. Symmetric set theory is a new set theory which provides simple universal definitions of 
the above notions. 

Currently the most widely studied formal theory of mathematics as a whole is Zermello-Fracnkel set 
theory (ZF) and its variants. The primary difference between ZF set theory and symmetric set theory involves 
points. A point is an object v/hich has no members (and is thus not a set). In ZF set theory there is only one 
point (the null set) while symmetric set theory requires the existence of many points. The following 
discussion of this issue is from the introduction to Foundations of Set Theory by Fraenkel, Bar-IIillel, and 
Levy [Fraenkel et. al. 58] (they refer to points as individuals). 

Let us refer to those elements which have members as sets and to those elements which have no 
members as individuals.... 

The existence of at least one individual is called for by both philosophical and practical reasons. 

... Let us, however, stress that referring to one of the individuals as the null set is done only for reasons 
of convenience and simplicity, and can be regarded as a mere notational convention. 

Having decided that we need an individual we now face the question of whether we need more 
than one individual. It turns out that for mathematical purposes there seems to be no real need for 
individuals other than the null set. Therefore we shall not admit any such individuals into ZF. 

While it is true that most mathematics can be done in a framework where the null set is the only 
point there are notions which are best defined in a framework where many points arc present. For example 
every mathematical object seems to have a natural notion of isomorphism associated with it. There is a 
natural notion of what it means for two Turing machines, or context free grammars, or topological spaces to 
be isomoiphic. ZF set theory provides no satisfactory general notion of isomorphism. The terms 
"representation", "distinguished element", and "more structure" are very hard to define in a framework 
where only one point is present. However these notions can be given simple universal definitions in the 
presence of many points. 

The notion of isomorphism can be approached from two different directions. The first is to extend 
standard notions of isomorphism for particular types of objects to a notion of isomorphism for arbitrary sets 
which arc built up out of points. This type of isomorphism will be called a structural isomorphism. The 
second approach to the notion of isomorphism is to consider the symmetries (automorphisms) of a universe 
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<U €>. A pair <U O is a universe of objects where U is a universal domain and € is a binary relation on U 
where x€y is read "x is an element of y", A point of <U €> is an element p of U such that there is no x in U 
such that x£p. The universe of symmetric sets has lots of points and lots of automorphisms. In particular 
there is a natural one to one correspondence between the the permutations of die points of <U €> and die 
automorphisms of <U €> (since ZF set dieory allows only one point there are no non-trivial symmetries of 
<U €> in ZF set theory). In the theory of symmetric sets it can be shown diat two objects x and y are 
structurally isomorphic just in case diere is an automorphism (a symmetry) of <U O which maps x to y. This 
result justifies the intuition diat isomorphic objects arc indistinguishable. 

Philosophers of mathematics have observed that mathematical descriptions of structures such as the 
natural numbers do not determine die identity of those structures [Benacerraf 65]. The best one can hope to 
do is to determine identity "up to isomorphism". The theory of symmetric sets reflects this observation. A 
specification for a particular object x might be a sentence 0 such diat <3>(y) holds just in case y is x. However 
consider any first order formula 0 of one free variable whose only non-logica! symbol is €. If x and y are 
isomorphic objects tiien there is a symmetry of <U €> which maps x to y and dius <f>(x) holds in <U €> just in 
case <h(y) holds in <U €>. Thus if there are several different objects which are isomorphic to x then no such 
sentence <f> can name x. In the theory of symmetric sets diere are always many objects which are isomorphic 
to x. 

The theory of symmetric sets provides a simple and natural measure of the "abstractness" of objects. 
A more abstract object is an object with "less structure". A precise definition of this notion can be 
approached in three different ways. First the symmetries of die universe of symmetric sets greatly restricts the 
predicates and functions which can be defined in terms of the structure of <U €>. A function F will be called 
essential if it commutes with any symmetry (automorphism) of <U €>, i.e. for any symmetry p of <U €> and 
any element x of U, p(F(x)) must equal F(p(x)). Given two elements x and y of U there may not exist any 
essential function F which maps x to y. For example diere is no essential function which maps a set of points 
to an element of that set. An object y is called an abstraction of an object x just in case there is an essential 
function F which maps x to y. This notion of abstraction can also be approached by studying die symmetries 
of particular objects. The symmetry group of an object x, denoted A(x), is the set of all automorphisms of 
<U Q which leave x fixed. The notion of an abstraction can also be defined by saying diat y is an abstraction 
of x just in case A(y) contains A(x). A third approach to die notion of abstraction is via the notion of 
contextual isomorphism and die general notion of a "canonical" object. Two objects x and y are said to be 
isomorphic in the context of z just in case there is a symmetry of z which maps x to y. For example consider a 


circle and two points p and q which are in die same plane as the circle. The points p and q are isomorphic in 
the context of the circle just in case they are die same distance from the center of the circle. An object y is said 
to be canonical in the context of an object z just in case die isomorphism class of y in the context of z, |y| z , is a 
singleton set (there is ho canonical point on a circle or canonical corner on a square). It turns out diat y is an 
abstraction of x just in case y is canonical in the context of x. 

Given the above notion of abstractness it is possible to provide a precise notion of "representation". 
For example finite sets can be represented as finite sequences because diere is an essential function which 
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maps any sequence to to the corresponding set. However it is easy to show that there is no essential function 
which maps finite sets of points to finite sequences of points. Thus sequences can not be represented as sets. 

Mathematics is often done in the framework of some fixed but arbitrary context. Intuitively a 
context is a collection of objects which arc taken to be fixed during die course of a mathematical discussion. 
The natural numbers, the real numbers, and the empty set arc all usually assumed to be fixed objects even 
though their "true identity" can not be specified. The result relating essential functions, symmetry groups, 
and canonical objects can be generalized to account for context. 

In addition to the different treatment of points there is another less important distinction between 
symmetric set theory and ZF set theory. The axioms of ZF set theory are (an infinite number of) sentences of 
first order logic while the axioms of symmetric set theory are precise conditions on die universe <U €> which 
are stated in English rather than first order logic. Thus symmetric set theory avoids all of the clumsiness of 
first order logic. Furthermore it is shown in an appendix that a simple extension of file axioms of symmetric 
set theory specify file structure of <U O up to isomorphism, something which could never be done in first 
order logic. 
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2. THE AXIOMS OF SYMMETRIC SET THEORY 


In deciding on axioms for a universe of mathematical objects there are several considerations. First 
the axioms should be as clear, simple, and natural as possible. Second, since ZF set theory is well established 
the axioms should not differ unnecessarily from ZF set theory. Finally, and most interestingly, the axioms 
should provide a basis for defining universal notions of isomorphism, abstraction, and representation. 
However one should not expect to immediately see how the axioms of symmetric set theory provide a basis 
for general notions of isomorphism, abstraction, and representation. These notions can be defined only after 
the consequences of the simple set theoretic axioms have been investigated. 

No proof of tine consistency of the axioms of symmetric set theory is presented in this section. 
However it is shown in an appendix that the consistency of the axioms is equivalent to the existence of a 
strongly inaccessible cardinal. The appendix also shows that any universe satisfying the axioms is determined 
up to isomorphism by a "height” and a "width" where the height can be any strongly inaccessible cardinal 
and the width can be any cardinal at least as large as the height. Thus if there are strongly inaccessible 
cardinals then there are many different (non-isomorphie) universes satisfying the axioms. There is however a 
unique (up to isomorphism) minimal universe <U €> whose height and width are both the least strongly 
inaccessible cardinal. Throughout the following sections however the universe will be taken to be some fixed 
but arbitrary model of the axioms. 

2.1. The Nature of the Universe <UO 

The universe of symmetric sets is token to be a pair <U €> where U is some domain and € is a 
binary relation on U. Some elements of U can be drought of as sets in tire standard way. For example 
consider tire pair <U €> given as follows: 

U is tire set (a b c d e f). The relation € is given by: 

a€d b€d c€d 
a€e b€e 
a€f c£f 

In this situation a, b, and c are points. 

Definition : A point is an clement p of U which has no members, i.e. x$p for all x in U. 

In the above example the clement d represents the set (a b c}, e represents the set {a b}, and f 
represents the set (a c}. Note that the relationship between d and tire set (a b c} is given by the relation € 
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and can not be defined purely in terms of the set U or the element d. Not all subsets of U need have 
representations in U. In the above example there is no representation for the set {e f}. 

Definition : A subset C of U is represented in <U €> just in case there is an element z of U 

such that x€z just in case x is in C; in this case z is called a representation of C. 

It is important to note that the notion of representation expressed in tire above definition is not the 
same as tire notion of representation intended in tire title of dris paper. A more general notion of 
representation will be presented section four. However in dris section dre term "representation" will be used 
only in the sense given in dre above definition. 

The axioms of the theory of symmetric sets imply that the universe U is not empty. While a special 
axiom to this effect is not needed, the fact drat U is not empty will be emphasized with an explicit axiom. 

Axiom zero'. U is non-empty. 

The first axiom of symmetric set theory is that representations are unique. 

Axiom One, Exlensionality: Representations of non-empty sets are unique, i.e. for any 

non-empty subset C of U there is at most one element x of U which represents C. 

The universe <U €> should be drought of as containing representations for tuples, functions, and 
relations as well as representations for subsets of U. For example if x and y are elements of U then z will be 
called a representation for dre pair <x y> just in case z represents dre set {x (x y}}, or more precisely z 
represents a set {x w} where w represents dre set {x y}. Functions and relations are represented by sets of 
tuples in die standard way (or more precisely a function is represented by a set of elements of U each of which 
represents a tuple). Again the term "representation" is being used here in a different sense from that 
intended in the title of the paper and a more general definition is given in section four. 

2.2. The Comprehension Axioms 

Given the above axioms it is possible diat U contains only points, i.e. diat no clement of U 
represents a non-empty subset of U. Axioms that require diat certain non-empty subsets of (J be represented 
in <U €> are called comprehension axioms. The first comprehension axiom makes use of the following 
definition: 

Definition: A subset C of U will be called small just in case there is some subset C’ of U 

which is represented in <U G> such that die cardinality of C is as big as the cardinality of C. 

A subset C of U which is not small will be called large. 
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By definition every subset C of U which is represented in <U €> is small. However it has not been 
guaranteed that every small subset of U is represented in <U £>. This is the first comprehension axiom. 

Axiom Two, Strong Comprehension: Every small subset of U is represented in <U £>. 

The above comprehension axiom implies that the cardinality of a subset C of U determines whether 
or not C is represented in <U €>. If C is small then it is represented in <U €>, if it is large then it is not. This 
leads to the following lemma: 

Lemma 2.1: If two element sets are small then for any small subsets C and C’ of U all 
functions from C to C’ are represented in <U €>. 

Proof. An ordered pair <x y> of elements of U is taken to be die set {x {x y}}. Thus if two element 
sets are small any ordered pair of elements of U is represented in <U €>. Any function from C to 
C is a set of such pairs with the same cardinality as C and is therefore a small subset of U. 

One model of the above comprehension axiom is a universe <U €> where U is infinite and a 
non-empty subset C of U is small just in case it has less than seven members. To rule out such a universe 
some further axioms are needed. 

Axiom Three, Infinity: There exists an element of U which represents a countably infinite 
subset ofU. 

Axiom Four, Power Set: If a subset C of U is small dien any subset of U with the cardinality 
of the power set of C is also small. 

Axiom Five, Union: A small union of small sets is small, i.e. for any family Fof subsets of U 
if Fis small (has cardinality less than or equal to some small subset of U) and if each set in F 
is small then the union of all sets in Fis also small. 

There is one final comprehension axiom which does not correspond to any axiom of ZF set theory. 
This final axiom will turn out to be important in later sections. 

Definition: P(U) is the set of all points in <U €>. 

Axiom Six, Point Comprehension: P(U) is large. 

This axiom implies that for any small subset C of U there is a set of points C’ which is the same size 
as C. Since both C and C’ arc small lemma 2.1 implies that the Injections (one to one onto functions) from C 
to C’ are represented in <U €>. Thus any small set can be "identified" with a set of points. 
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2.3. The Foundation Axiom 


The Foundation axiom is the final axiom of the'theory of symmetric sets. It states that there are no 
infinitely decreasing membership chains. 

Axiom Seven, Foundation'. There is no infinitely decreasing sequence of elements of U, i.e. 

there is no infinite sequence x^3 ^ 3 x 33 ... 

The foundation axiom has important implications. It implies that there is no element x of U such 
that x€x (otherwise the infinite sequence all of whose elements were x would be an infinite decreasing 
sequence). In fact there can be no containment loops, i.e. no sequence X|€x 2 € ... x n such that x^x^. The 
foundation axiom is equivalent to the statement that every subset C of U contains a lower bound under €, i.e.' 
any subset C of U contains a lower bound x such that there is no y in C such that y€x. The foundation axiom 
can also be characterized in terms of the transitive closure of €. 

Definition : The binary relation € + is defined to be the transitive closure of 6, i.e. for any 

two elements x and y of U, y€ + x just in case y€x or there is some finite sequence Zp z^ ... 

z n such that - z n €x. 

The foundation axiom ensures that the relation £ + is a partial order on U and that £ + is well 
founded. The fact that € + is a well founded partial order on U allows one to define functions on U by 
recursion on £~^. For example it is possible to define a function P which maps every element of U to its 
underlying set of points. This function is defined by recursion on € + as follows: 

Definition of the Junction P: 

P(q) = (q) for any point q 
P(x) = U y€x P(y) for any non-point x 

Thus for any element x of U we can talk about the points P(x) of the element x. Note that P(x) is 
always a subset of U rather than being any particular clement of U (in fact P(x) is always a subset ofP(U), the 
set of all points). It is not immediately obvious that for any element x of U the set P(x) is small and therefore 
represented in <U O. However this does follow from the axioms presented so far. 

Theorem 2.2: For any element x of U, the set P(x) is small. 

Proof. The proof is by induction on £"t, For a point p the set P(p) is just {p} which is clearly 
small. Now consider any element x of U which is not a point and such that for every y such that 

y£ + x, P(y) is small. P(x) equals the union over y€x of P(y) and therefore P(x) is a small union of 
small sets and must be small. • 
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3. ISOMORPHISMS 

A point is that which has no part. 

- Euclid 


It seems that every precisely defined object has a natural notion of isomoiphism associated with it. 
For example graphs, context free grammars, Turing machines, and topological spaces all have a natural 
associated notion of isomorphism. All of these notions of isomorphism are based on identifications between 
the points of one object and the points of another. This observation motivates a notion of structural 
isomorphism defined for arbitrary elements of the universe <U £> of mathematical objects. 

It is intuitively clear that isomoiphic mathematical objects are in some sense identical. The strongest 
sense in which two elements x and y of U can be identical is if there is some symmetry (automorphism) of 
<U £> which maps x to y. It turns out that under the aforementioned notion of structural isomorphism tw r o 
objects are structurally isomorphic just in case there is a symmetry of <U £> which maps one to the other. 

The notion of isomorphism can be generalized to take into account an arbitrary but fixed context. 
At one level any two points p and q are isomorphic. However if p and q appear in some fixed context then p 
and q need not be considered isomorphic. For example p and q are isomorphic in tire context of the set 
(p q {r s}} but they are not isomorphic in the context of the set (p (q r}}. It turns out that for any element z 
of U which is taken as a fixed context there is a natural and general definition for when two elements of U are 
' isomorphic in tire context of z. 

The universe <U £> is intended to be a model of the universe of all mathematical objects. For this 
reason the term "object" will be used as a synonym for the phrase "an element of U". 


3.1. The Symmetries of <U £> 


An automorphism or symmetry of <U £> is a one to one onto map p from U to U (a permutation of 
U) such that for any x and y in U, p(x)£p(y) just in case x£y. It can be shown that for any symmetry p of 
<U £> and any object x, p(x) is a point just in case x is a point. Thus for any symmetry p of <U £> the 
restriction p|P(U) of p to the points P(U) is a one to one onto map from P(U) to P(U), i.c. p|P(U) is a 
permutation of P(U). The first important theorem concerning the symmetries of <U £> is that each symmetry 
is determined by its corresponding permutation of P(U). 

Theorem 3.1: If p and p’ arc two symmetries of <U £> such that p|P(U) equals pjP(U) then 
p equals p’. 

Proof. The proof is by induction on £ + . By assumption p and p’ arc the same function on points. 

Consider any x in U such that p and p’ have die same value on all elements y of U such that y£ + x. 

Since p preserves the membership relation the set represented by p(x) equals fp(v): y£xj. 

Similarly the set represented by p’(x) equals {p’(y): y£x). But since p and p’ arc the same function 
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on all y£ + x, p(x) and p’(x) must represent the same set. Thus by extcnsionality p(x) must equal 
p’(x). 

Any symmetry of <U €> determines a permutation of P(U) and theorem 3.1 shows that the induced 
permutation of P(U) uniquely determines the symmetry. It can also be shown that every permutation of P(U) 
corresponds to a symmetry of <U €>. 

Theorem 3.2 : Any permutation p of P(U) can be extended to a symmetry of <U €>. 

Proof. Let p be any permutation of P(U). The extension of p to all of U is defined by induction on 
€ + via the following relation: 

p(x) = the representation of (p(y): y€x} 

The set (p(y): y£x} is guaranteed to be represented in <U £> because it can be no larger than the 
set represented by x. It follows from tire above equation that if y£x then p(y)£p(x) and further if 
p(y)€p(x) then y£x. It remains only to show that the extension of p to all of U is one to one and 

onto. Consider the inverse permutation p‘* of P(U) and the extension of this inverse to all of U. It 

can be shown by a standard induction on that p"^(p(x)) equals x for all x in U and thus the 

extension of p is one to one. Similarly it can be shown that p(p’^(x)) must equal x and thus the 
extension of p is onto. 

Theorems 3.1 and 3.2 imply that there is a natural one to one relationship between die permutations 
of the points P(U) and die symmetries (automorphisms) of <U €>. In classical set dieory there is only one 
point and there is only one symmetry of <U €>, namely the identity function. 

If diere is a symmetry of <U O which maps x to y dien x and y are truly indistinguishable. More 
concretely let 0 be any first order formula of one free variable whose only non-logical symbol is 6. If there is 
a symmetry of <U £> which maps x to y then <h(x) holds in <U €> just in case 4>(y) holds in <U O. 

Theorem 2.2 says diat for any object x the set P(x) is small and dius P can be thought of as a 
mapping from U to U. A simple induction on £ + can be used to show diat the mapping P commutes with 
symmetries of <U €>, i.e. that for any symmetry p of <U £> and any object x, P(p(x)) equals p(P(x)). 

Lemma 3.3: For any symmetry p of <U €> and any object x, P(p(x)) equals p(P(x)). 

3.2. Structural Isomorphisms 


There is a natural definition for what it means for two graphs, or languages, or lists of points to be 
isomorphic. All of these objects can be represented by elements of U and it would be nice if the notion of 
isomorphism which is defined for elements of U corresponded to the natural notion of isomorphism for such 
objects. This observation leads to the definition of structural isomorphism presented below. 

Any two points have the same structure simply because neither has any structure. Larger objects are 
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structurally isomorphic just in case there is an identification between the points of the objects which preserves 
the structure of those objects. For example let p, q, r, and s be any four distinct points. The set {p q} is 
isomorphic to the set {r s}, but not the set {r sp}. In fact any two sets of points are isomorphic just in case 
they have the same number of elements. The triple <p p q> is isomorphic to tire triple <r r s> but not to the 
triple <r s s>. 

To define the notion of structural isomorphism precisely it is necessary to build up some 
terminology. Let C be any set of points. U(C) is defined to be the set of all elements x of U such that P(x) is a 
subset of C. Thus U(C) is die set of objects which are built up out of the points in C. Let p be any function 
mapping C to arbitrary points. Any such function p can be extended to a function p’ defined on all of U(C) 
via the following inductive definition; 

p’(p) = p(p) for points p 

p’(z) = The representation of (p’(y): y£z} for any non-point z in U(C). 

For example if p(p) is r and p(q) is s then p’(<p q>) is <r s>. Thus tire function p "replaces" the 
points of an object by their image under p. In the inductive definition of p’(z) the set {p’(y): y€z} is 
guaranteed to be represented in <U O because it can be no larger than the set represented by z. In tire 
following discussion any function p defined on the points C will be assumed to be defined in the above way 
on all ofU(C). 

Definition : A structural isomorphism between two elements x and y of U is a bijection p 
from P(x) to P(y) such that p(x) equals y (any function defined on P(x) will be assumed to be 
defined on x via the above relation). The elements x and y are said to be structurally 
isomorphic just in case there exists a structural isomorphism between them. 

As an example let a group <G °> be a pair of a set of points G and a function ° from GXG to G 
satisfying the standard axioms for a group. Notice that P(<G °>) equals G. Now consider two groups <G °> 
and <G’ °’> and let p be any bijection from G to G’. Clearly p(G) equals G’ so p(<G °>) equals <G’ p(°)>. 
Thus the bijection p is a structural isomorphism between <G °> and <G’ °’> just in case p(°) equals °\ Since 
functions are represented by sets of tuples an element of 0 is a triple of points <p q r> where r is the value of 
p°q. Thus p(°) is a set of triples of the form <p(p) p(q) p(r)>. The set of triples p(°) will equal the set of 
triples °’ just in case for any triple <p q r> in 0 the triple <p(p) p(q) p(r)> is in °\ i.e. just in case p(r) equals 
p(p)°'p(q). This statement is equivalent to the condition that p(p)°’p(q) equals p(p°q) which corresponds to 
the standard notion of isomorphism between groups. 

The notion of a structural isomorphism can be related to the symmetries of <U £>. It has already 
been shown that there is a natural one to one correspondence between the automorphisms of <U O and the 
permutations of P(U). The following theorem relates bijcctions between sets of points and permutations of 
P(U). 
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_ Theorem 3.4: Any bijecdon between two small sets of points can be extended to a 

permutation of all ofP(U). 

Proof Sketch: Let C and C be any two small sets of points and let p be any bijection from C to C’. 
Let p be any member of cither C or C\ Since C and C need not be disjoint p may be in both C and 

C’ and thus both p(p) and p'^(p) may be defined. In general any point p in either C or C is 

contained in some minimal chain of the form ... p'kp^(p)), p’^(p), p, p(p), p(p(p))... (more 
precisely the minimal chain containing p is tire least subset of CUC’ which contains p and is closed 

under p and p'^). The minimal chain containing p can be one of four types. First it might be a 
loop, in which case p is already a permutation of the minimal chain containing p. Second it might 
be infinite in both direction in which case p is also already a permutation of the chain. Third the 
chain may have a "first" member which is in C but not in C’ and a last member which is in C’ but 
not in C. Finally the minimal chain containing p may have only one endpoint, either a starting 
point or an ending point. In these cases p is not a permutation of the minimal chain containing p. 
To remedy this situation one can extend tire function p to more points and convert any chain of the 
these last types into either a cycle or a chain which is infinite in both directions. To make such an 
extension there must be enough points in P(U) which are not in C or C. But since both C and C’ 
are assumed to be small this last condition can be readily shown. 

The main result of this section can now be proven. 


Theorem 3.5 Two elements x and y of U are structurally isomorphic just in case there is a 
symmetry of <U €> which maps x to y. 

Proof: If x and y are structurally isomorphic then there is a bijection p from P(x) to P(y) which 
maps x to y. Any extension of p to more points will still map x to y. Thus theorem 3.4 implies that 
p can be extended to a permutation of P(U) which maps x to y. On the other hand if there is a 
permutation p of P(U) which maps x to y lemma 3.3 implies that P(p(x)) equals p(P(x)) so p(P(x)) 
equals P(y) and thus p maps P(x) onto P(y). Thus the restriction of p to P(x) is a bijection from P(x) 
to P(y) which maps x to y. 

Theorem 3.5 demonstrates that the two natural notions of isomorphism between symmetric sets 
coincide. Thus there is never any ambiguity in what is meant by two elements of U being isomorphic. 


3.3. Symmetry and Contextual Isomorphisms 

Since there is a natural notion of isomorphism for elements of U there is also a natural notion of 
automorphism or symmetry. For example there arc two structural symmetries of {p q}, the identity map on 
the points p and q and the function which exchanges p and q. Since any structural isomorphism can be 
extended to a symmetry of <U O there is no need to distinguish between structural isomorphisms and 
permutations of P(U). 

Definition: The symmetry group of an object x, denoted A(x), is the sec of all permutations 
p of P(U) such that p(x) equals x. 

/""N 

There is a contextual notion of isomorphism where two objects x and y arc isomorphic in a context z 
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just in case x and y bear exactly the same relationship to z. For example let z be the set {p q {r s}} where p, q, 
r, and s are points. Clearly the point p is just like the point q with respect to z (both p and q are members of z 
and neither is a member of a member of z). Similarly tire point r is just like tire point s with respect to z. On 
the other hand p is not like the point r since p is a member of z while r is not. 

As another example let y be the set (<p q> <q r> <r p>}. This represents a directed graph with nodes 
p, q, and r and edges from p to q, q to r, and r to p. In other words y represents a cyclic directed graph of 
three nodes. Notice that every node of y looks like every other node. More precisely y has three structural 
symmetries corresponding to three rotations of the graph. For any two nodes there is a rotation which maps 
one to the other. 

Definition : Two objects x and y will be called isomorphic in the context of an object z just in 
case there is a symmetry p of z such that p(x) equals y. The set of things isomorphic to y in 
the context of z will be denoted |y| z . 
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4. ABSTRACTION, REPRESENTATION, AND OTHER APPLICATIONS 


There is an informal distinction in mathematic's between essential and contextual properties. For 
example consider an open set of some topological space. The fact that tire set is open is a contextual property 
of that set while the cardinality of die set is an essential property. In the framework of symmetric sets this 
distinction is easily made precise. Any two isomorphic objects have die same essential properties. An 
essential predicate $ is any predicate on U such diat <T>(x) is equivalent to 4>(p(x)) for any object x and point 
permutation p. An essential function is one that commutes with automorphisms of <U €>, i.e. F(p(x)) always 
equals p(F(x)). 

In mathematics one often encounters a notion of a "canonical" or "natural" transformation or 
relationship. For example a set of points has no natural or canonical clement, there is no natural or canonical 
point on a circle, and a square has no canonical corner. On die other hand one can choose a canonical 
element of an ordered pair. One particularly well known example is the dual space of a linear vector space. 
The dual space D(X) of a linear vector space X is the set of linear functions from X to scalars. If diere is a dot 
product operation * defined on X then there is a natural isomorphism between X and D(X) where the linear 
function associated with a vector x is Xy.x'y. However if no dot product operation is specified for X then 
while X and D(X) are still isomorphic there is no natural or canonical isomorphism. On the other hand diere 
— is always a canonical or natural isomorphism between X and D(D(X)). A simple and natural definition for 

r A 

tliis notion of canonical is given by saying diat y is canonical in die context of x just in case the isomorphism 
class of y in the context of x, |y| , contains only one object. 

The theory of symmetric sets provides a simple general measure of the abstractness of objects. An 
object y can be said to be an abstraction of an object x just in case any one of the following diree conditions 
hold: A(y) contains A(x), |y| is a singleton set, or y equals F(x) for some essential function F. It turns out that 
these diree criterion are equivalent and there is no ambiguity in what is meant by y being an abstraction of x. 

There arc many representation theorems in mathematics. For example every Boolean algebra can 
be represented by an algebra of sets. Of course die notion of a representation is also heavily used in computer 
science where alphabets are represented as binary codes and sets are represented as lists. This raises die 
natural question of what is meant in general by a representation. The theory of symmetric sets provides a 
natural framework in which to develop a general theory of representation. 

Mathematicians often talk about fixed but arbitrary structures which form a context in which to 
investigate other structures. For example the natural numbers are assumed to be a fixed set even diough their 
"true identity" can never be specified. The same holds for die real and complex numbers. Another example 
of contextual objects are die fixed constants "tine" and "false" which are used in discussions of logic. Still 
another example from logic is die fixed but arbitrary alphabet from which the sentences of logic are 
constructed. Even the "empty set" can be viewed as an object which is taken to be fixed but whose identity is 
never specified. The general notion of a context can be handled in a natural way in the theory of symmetric 
sets. A context is an object z (which may have lots of internal structure) which is "taken to be fixed". This 
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means that only symmetries of z are considered when talking about isomorphisms, symmetry groups, essential 
properties, and canonical objects, 

4.1. Essential Functions, Canonicality, and Abstraction 


The set {p q} can be thought of as essential property of the pair <p q>. That is to say that given a 
pair of points one can derive in a natural way a set of two points. The reverse docs not seem to hold, given a 
set of two points there is no natural or canonical way to derive a pair of two points. Similarly given a point p 
there is no natural or canonical set of two points which contains p. Conversely given a set of two points there 
is no natural or canonical element of that set. Recall that for any objects y and z, |yj z is the set of all things 
isomorphic to y in the context of z, i.e. |y| is the set of things which can be written as p(y) for some symmetry 
p of z. 


Definition: An object y will be called canonical in the context of an object z just in case |y| 
is a singleton set. 

There are many intuitive examples of objects which are not natural or canonical. There is no natural 
or canonical point on the perimeter of a circle. There is no canonical corner on a square. There is no natural 
or canonical coordinate system for three dimensional space. On the other hand consider an oblique triangle 
(where no two sides have the same length). One can choose a canonical vertex for such a triangle by choosing 
the vertex connecting the two shorter sides. 

There is another intuitively satisfying notion of what a canonical object is. Intuitively y is canonical 
in the context of z if one can define a function which takes z and unambiguously returns y. This notion of a 
canonical object is problematic because for any two objects z and y there is a function which maps z to y. 
However one wants a definable function. Remember that the universe <U 6> has many non-trivial 
automorphisms and every function from U to U which is defined in terms of die structure of the universe 
<U €> must respect those automorphisms. In particular any function F defined in terms of the structure of 
<U €> must be essential in the following sense: 

Definition : A function F from U to U will be called essential just in case for every symmetry 
p of <U €> and every element x of U, F(p(x)) equals p(F(x)). 

Intuitively die image of an object x under an essential function F can contain no more information 
than the object x itself. Thus it might be said that F(x) is an abstraction of x. 

Definition: An object y will be called an abstraction of an object z just in case y equals F(z) 
for some essential function F. 

The main result of dais section relates the the notion of canonicality, the notion of abstraction, and die 
symmetry groups of objects. 
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Theorem 4.1, The First Abstraction Theorem'. For any objects y and z the following are 
/■"N equivalent: 

1) A(y) contains A(z). 

2) |y| is a singleton set. 

3) y equals F(z) for some essential function F. 

Proof: 

1< = >2: If A(y) contains A(z) then for any symmetry p of z, p(y) equals y so |y| contains only 

y. On the other hand if |y| z is a singleton set then p(y) equals y for any symmetry p of z so A(z) is a 
subset of A(y). 

1 = >3: Given that A(y) contains A(z) an essential function F from U to U can be defined as 
follows: Of course F(z) is defined to be y. For any w which is isomorphic to z let p be some point 
permutation such that p(z) equals w and define F(w) to be p(y) (this is equivalent to defining 
F(p(z)) to be p(F(z)) for any point permutation p). It must be shown that this definition of F(w) is 
independent of the choice of p, In particular if p and p are two point permutations such that p(z) 
equals p’(z) equals w then it must be shown that p(y) equals p’(y). If p(z) equals p’(z) then 

p"^(p’(z)) equals z so p'^°p’ is a symmetry of z. But since A(z) is a subset of A(y), p ^°p’ is a 

symmetry of y and thus p"^(p’(y)) equals y which implies that p’(y) equals p(y). 

It will now be shown that for any w which is isomorphic to z and any point pennutation p, 
F(p(w)) equals p(F(w)). Since w is isomoiphic to z there is some point permutation p’ such that w 
equals p’(z). Now F(p(w)) equals F(p°p’(z,)) which equals p°p'(F(z)) which equals p(p’(F(z)) 
which equals p(F(p’(z)) which equals p(F(w)). Thus for any w isomoiphic to z, F satisfies the 
condition for being an essential function. To complete the definition of F let F(w) be w for any w 
not isomorphic to z. 

O 3 = >]: Suppose F is an essential function such that F(z) equals y and let p be any symmetry of 

z. Since F is an essential function p(F(z)) equals F(p(z)) which equals F(z). Thus p is a symmetry 
of F(z), i.e. p is a symmetry of y. Thus A(y) contains A(z). 


4.2. Abstraction and Points 


The notion of abstraction has some important relationships to points. The first lemma about points 
concerns the isomorphism classes of points in the context of an object x. 

Lemma 4.2: For any object x and point p, |p| is either a subset of P(x) or is all of P(U)-P(x). 

Proof: For any symmetry p of x, p(P(x)) equals P(p(x)) equals P(x). Thus any symmetry p of x 

induces a pennutation of P(x). Thus for any point p in P(x), |p| x is a subset of P(x). On the other 

hand for any two points r and s which are not in P(x) there is a symmetry of x which exchanges r 
and s. Thus if r is not in P(x) then |r| x contains all of P(U)-P(x). Furthermore for r not in P(x), |r| x 

can not intersect P(x) since otherwise there would be some point p in P(x) such that |p| x was not a 
subset of P(x). Thus either p is in P(x) and |p| x is a subset of P(x) or p is not in P(x) and |p| x equals 
P(U)-P(x), 

Corollary 4.3: For any object x and point p, p is in P(x) just in case |p| is small. 
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Corollary 4.3 immediately implies that for any object x the set P(x) is determined by the symmetry 
group of x. In fact Corollary 4.3 leads directly to tire following dicorcm: 

Theorem 4.4 For any objects y and x, if y is an abstraction of x then P(y) is a subset of P(x). 

Proof. Let p be any point in P(y). Since A(y) contains A(x), |p| contains |p| . Rut by Corollary 4.3 

y a 

|p| is small so |p| must also be small and therefore p must be in P(x). 
y * 

Theorem 4.4 immediately implies that for any for any essential function F and any object x, P(F(x)) 
is a subset of P(x). It is important to realize that the converse of theorem 4.4 does not hold, i.e. if P(y) is a 
subset of P(x) then y need not be an abstraction of x. For example {p q] is not an abstraction of {p q r}. 

4.3. Representation and Transformational Isomorphisms 


Any finite set of points can be represented by a finite list of points though there is no canonical 
representation for a set of points as a list of points. More precisely there is an essential function F which maps 
any finite list of points to the finite set of points contained in that list and any finite set y can be written as 
F(x) for some finite list x, Note that the function F is from die representations to die represented objects. 
Also note that die function from representations to represented objects is onto, i.e. every object which is to be 
represented must have a representation. These observations lead to the following definition of a uniform 
representadon. 

Definition'. Let C and R be subsets of U, let F be an essential function, and let F(R) denote 
(F(x) for x in R}. F is said to be a uniform representation of elements of C as elements of R 
just in case F(R) contains C. 

Note that lists of poin ts can not be represented as sets of points because there is no essential function 
which maps a set of points to a list of diose points (there is no canonical representation of a set of points as a 
list). A similar example involves multisets. A list of points can be used to represent a multiset of points, but 
multisets of points can not be used to represent lists. 

There are certain cases in mathematics where tv/o different (non-isomorphic) things arc "essentially 
the same thing". For example an equivalence relation on a set of points C is a relation, i.e. a set of pairs, 
which is reflexive, symmetric, and transitive. A partition of C is a family of disjoint subsets of C. Any 
equivalence relation on C can be viewed as a partition of C and vice versa. Another simple example involves 
the representation of tuples. For example a tuple of points <p q> can be viewed as die set (p (p q}} or as die 
set (q {p q}}. The following definition makes the notion of "essentially die same" more concrete. 
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Definition'. An essential function F from U to U which is also a permutation of U will be 
called a transformational symmetry of <U €>. Two objects x and y will be called 
transformationally isomorphic just in case there is a transformational symmetry of <U €> 
which maps x to y. 

If two objects x and y are transformationally isomorphic then there is a sense in which they are 
indistinguishable. More precisely the following lemma holds: 

Lemma 4.5 : For any transformational symmetry F diere is a translation operator T which 
maps any monadic essential predicate <I> to a monadic essential predicate T(d>) such that for 
any object x, 4>(x) holds just in case T(4>)(F(x)) holds. 

Proof. Since F is a permutation of U it has an inverse F’^ which is easily shown to be an essential 

function. Let T(<I>) be the predicate Aw.<f>(F^(w)). Clearly 4>(x) is equivalent to T(d>)(F(x)). 

Furthermore T(<h) is easily seen to be an essential predicate. 

Lemma 4.5 gives a precise relationship between objects which are transformationally isomorphic. In 
particular if x and y are transformationally isomorphic via the transformational symmetry F then any essential 
statement (or question) <J>(x) concerning x is equivalent to some essential statement T(<I>)(y) concerning y. It 
turns out that two objects x and y are transformationally isomorphic just in case A(x) equals A(y). However 
the condition that A(x) equals A(y) does not ensure that there is a definable transformational symmetry F 
which maps x to y. 

Definition'. A function F from U to U will be called definable just in case there is a first order 
formula 4> of two free variables whose only non-logical symbol is € such that for any two 
objects x and y, $(x y) holds in <U €> just in case y equals F(x). 

Consider the real numbers <R + * <> where R is a set of points, + and * are binary operations on 
R, and < is a total order on R. A(<R + * <>) is the group of all symmetries which leave every point in R 
fixed (any symmetry of <R + * <> must leave one and zero fixed). Now consider a pair <R <’> where <’ is 
a well ordering of the set of points R. It is easy to show that A(<R <’>) is also the group of all permutations 
which leave every point in R fixed. Since A(<R + * <>) equals A(<R <’>), <R + * <> is transformationally 
isomorphic to <R <’>. However there is probably no definable transformational symmetry of <U €> which 
maps <R + ‘ <> to <R <’>. 

4.4. Context 


Intuitively a context is a collection of objects which are taken to be "fixed". There arc some objects 
which are taken to be fixed over all of mathematics. For example mathematicians often speak of "the" 
natural numbers, even though' the identity of the natural numbers can not be specified (though the structure 
of the numbers can be specified up to isomorphism). The same holds for "the" real numbers, or "the" 
complex plain. In logic one often assumes that there is a particular thing which is the constant "true" and a 
particular thing which is the constant "false". A more controversial example is the empty set. There are other 
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examples of "context" where the context is not even specified up to isomorphism. The phrase "fixed but 
arbitrary" is often used in mathematical writing and serves to specify a context for a mathematical discussion. 

As another example of context consider a linear vector space. A linear vector space has an associated 
field (usually "the" real or complex numbers) such that any vector can be scaled by an clement of the field. 
In discussions of linear vector spaces the field is usually taken to be fixed. Thus in relating two vector spaces 
one usually assumes they have the same associated field of scalars. 

As yet another example consider a particular first order language L. The language L is taken to be a 
sequence of typed symbols wh ich determines a set of well formed formulas. Such a language is usually taken 
to be arbitrary but fixed in discussions oflogic. 

To generalize the results of tire previous section it is useful to define tire notion of a contextual 
symmetry group. 

Definition'. The symmetry group of y in the context of z, denoted A z (y), is the set of all 
symmetries of z which are also symmetries of y. More simply A (y) equals A(y) intersect 
A(z). 

Note that any symmetry of the pair <y z> must be a symmetry of y and a symmetry of z, and 
anything which is a symmetry of both y and z must be a symmetry of the pair <y z>. Thus A(<y z>) is the 
intersection of A(y) and A(z) so A z (y) equals A(<y z>). The notion of an essential function can also be made 
contextual: 

Definition'. A function F from U to U will be called essendal in the context of z just in case 
for any object x any symmetry p of z, F(p(x)) equals p(F(x)). 

A good example of a contextually essential function is the cardinality function on finite sets. Let 
<N <> be "tire" natural numbers where N is a set of points and < is a binary relation which orders those 
points. Tire function F which maps any finite set x to the natural number representing the size of x is essential 
in the context of <N <>. Note that this cardinality function is not essential outside of this context because 
F(x) can be a point not found in x. The following lemma provides an alternative characterization of functions 
which are contextually essential. 

Lemma 4. 7: A function F from U to U is essential in tire context of z just in case it can be 
written as Ax.G(<x z>) for some essential function G. 

Proof. If F can be written as Ax.G(<x z>) then it is easy to show that F is essential in tire context of 
z. On the other hand assume F is essential in the context of z. First if w is not a pair whose second 
component is isomorphic to z then G(w) is defined to be w. If w is a pair whose second component 
is isomorphic to z then w can be written as p(<x z>) for some object x and point permutation p. In 
this case G(w), which can be written as G(p(<x z>)), is defined to be p(F(x)). It must first be shown 

that G is well defined, i.e. that if p(<x z>) equals p’(<y z>) then p(F(x)) equals p’(F(y)). First note 
1 . 1 1 1 
that p (p'(<y z>)) equals <x z> so p (p’(y)) equals x and p (p’(z)) equals z. Thus p °p’ is a 

symmetry of z. Now since F is essential in the context of z, p‘^°p’(F(y)) equals F(p’^°p’(y)) which 

equals F(x). But if F(x) equals p"^°p’(F(y)) then p(F(x)) must equal p’(F(y)). 
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It follows directly from the definition of G that G(<x z>) equals F(x) and thus F can be written 
as Ax.G(<x z>). To show that G commutes with arbitrary point permutations let w be any object 
and p be any point permutation. If vv can not be written as p’(<x z>) for some x and p’ then G(w) 
equals w and G(p(w)) equals p(w) so tire result is trivial. On tire other hand suppose w can be 
written as p’(<x z>). In this case G(p(w)) equals G(p°p’(<x z>)) which equals p°p’(F(x)) which 
equals p(G(p’(<x ?>))) which equals p(G(w)). 

The following abstraction theorem is a generalization of tire first abstraction theorem. 

Theorem 4.8, The Second Abstraction Theorem : For any objects x and y and context z the 
following are equivalent: 

1) A z (y) contains A z (x) 

^ M<x z >' s a singleton set 

3) y equals F(x) for some function F which is essential in the context of z 

Proof. The first condition is equivalent to tire statement that A(<y z>) contains A(<x z>). The 
second condition is equivalent to tire statement that |<y z>| < - x z y is a singleton set. Finally lemma 

4.7 implies that the third condition is equivalent to tire statement that <y z> equals G(<x z>) for 
some essential function G. Thus the equivalence of these three statements follows directly from the 
first abstraction theorem. 


4.5. Essential Predicates 


Essential functions have been shown to play an important role in characterizing the nature of 
abstractions and the notion of a natural or canonical property. Essential predicates are closely related to 
essential functions and can play much the same role in constructing abstractions. 


Definition-. A binary predicate <f> on U is called essential if for any objects x and y and point 
permutation p, 4>(x y) holds just in case <h(p(x) p(y)) holds. 

The relationship between predieates and functions can be made more explicit by defining a monadic 
function F { j> for each binary predicate 4>. 

Definition'. For any binary predicate <f> on U, F^ is the function from U to subsets of U such 
that F ( j ? (x) equals {y: <h(x y)}. 

For example let $ be the predicate such that <f>(x y) holds just in case y is a pair whose first 
component is x. In this case F f j,(x) is the set of all pairs whose first component is x. Note that F^x) is a large 
set and thus has no representation in <U €>. Thus in general F^ can not be thought of as a function from U 
to U. The following definitions will be useful in discussing the functions associated with essential predicates. 


Definition-. Let C be any subset of U. P(C) is the union over x in C of P(x). For any point 
permutation p, p(C) is the set fp(x): x in C}. The symmetry group of C, denoted A(C), is 
the set of all point permutations p such that p(C) equals C. The isomorphism class |C| ? of C 

in the context of an object z is the family of all sots which can be written as p(C) for some 
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point permutation p. 

For any essential predicate 0 the function commutes with point permutations, i.c. K^(p(x)) 
equals p(F^(x)). Thus F^ can be thought of as an essential function. However as tire above example shows 
I"q>( x ) can be large and P(F^,(x)) can be all of P(U). Thus some of tire theorems concerning essential 
functions do not apply to F^. However many of the results concerning essential functions can be generalized 
to the functions associated with essential predicates. 

Theorem 4.9, The Third Abstraction Theorem'. For any object z and (possibly large) subset C 

of U die following are equivalent: 

1) A(C) contains A(z) 

2) |C| Z is a singleton family 

3) C equals F ( ^(z) for some essential predicate 4>, 

The proof of the above theorem is analogous to the proof of the first abstraction dieorem. In 
showing that 1) implies 3) die predicate $ is defined by setting <h(p(z) p(y)) to be true for any point 
permutadon p and any element y of C, and $(w x) to be false if <w x> can not be written as <p(z) p(y)> for 
some y in C. 

Essential predicates can be diought of as defining abstraction functions from objects to more 
abstract objects. For an essential predicate $ and object x it does not seem very important that F ( j,(x) may be 
large, die important point is that the symmetry group of F^(x) contains die symmetry group of x. 

A good example of the use of essential predicates in defining abstractions is a multiset. Let f and g 
be two finite functions (they each have a finite domain). The functions f and g will be said to represent the 
same multiset if there exists a bijection <r from die domain of f to die domain of g such diat for any x in the 
domain of f, f(x) equals g(a(x)). Intuitively f and g represent the same multiset if for any range element y, f 
and g map the same number of objects onto y. Let 0 be the binary predicate on U such diat <I>(fg) holds just 
in case f and g are finite functions which represent die same multiset. It is easy to show that $ is an 
equivalence relation on finite functions and that F <{> (f) is the equivalence class of f under diis relation. The 
symmetry group of F$(f) is not die full permutation group on P(U) but is larger than die symmetry group of 
f. Thus F^(f) can be diought of as die multiset abstraction of f. 
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5. RELATION TO OTHER WORK 

/""N 

The notions of isomorphism, symmetry, and representation are ubiquitous in mathematics and 
probably have numerous independent origins. The relationship between symmetry and permutation groups 
is also well known. This relationship has been studied in some detail and it has been shown, for example, that 
not every permutation group can be represented as the symmetry group of a graph [Biggs 74], But while the 
notions of isomorphism and symmetry have been extensively used for objects of a given type (e.g. graphs, 
groups, algebras, languages, grammars) these particular notions of isomoiphism do not provide a notion of 
isomorphism defined over all mathematical objects. 

Category theory provides one general approach to the notion of isomorphism. A category can be 
thought of as a directed multigraph with an associative composition operator 0 on arcs and for each node n an 
assigned "identity" arc from n to n. The nodes of a category are often associated with sets and the arcs with 
functions between these sets. Thus die arcs are called "morphisms". An isomorphism is defined to be an arc 
p which has an "inverse" arc p"* such that both p°p"^ and p’^°p are identity arcs [Schubert 72], Category 
theory provides a general theory of isomorphism to the extent that every mathematical object can be thought 
of as a node ("object") in a category. For example a group can be thought of as a node in the category of 
groups, a graph as a node in the category of graphs, etc. However the category containing a given object must 
be defined separately for each type of object. In fact the category containing an object of a given type is 
usually defined in terms of die notion of isomorphism (and homomorphism) for objects of that type. 
Therefore category theory does not provide any satisfying general notion of isomorphism for arbitrary 
mathematical objects. 

The notion of a type used in universal algebra and computer science provides another approach to a 
general definition of isomorphism. An algebra is a domain together with some functions defined over that 
domain. In die universal study of algebras each particular algebra has a type (or signature or language) which 
is a set of symbols which are interpreted by that algebra. For example the type of a group is tire single binary 
function symbol °. There is a natural definition of isomorphism for the algebras of a fixed type such that two 
algebras A and B are isomorphic just in case there is a bijection from the domain of A to the domain of B 
which maps A’s interpretation of any symbol to B’s interpretation of that symbol. 

The notion of type also plays a critical role in many modern computer languages [Tennant 81], 
There is one particular outlook on the types of computer data structures which provides a basis for a notion of 
isomorphism. Under this view a type is a collection of objects which can be defined in a "natural" way from a 
collection of base types. For example if A and B are base types then the set of functions from A to B is also a 
type. Similarly die set of pairs AXB of an clement of A and an element of B is a type. As another example let 
the type SubScts(AXA) be die collection of all sets of ordered pairs of elements of the base type A. More 
simply SubScts(AXA) is the type containing all directed graphs whose nodes are members of A. Several 
' people studying such data types and have employed permutations of the elements of base types to define a 
notion of a "natural" function between types [Abo & Ullman 79] [Outlining & Yap 82], In fact Dunlaing and 
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Yap implicitly use a notion of isomorphism based on permutations of the elements of base types in defining 
the automorphism group of an arbit rary typed object. 

While there are strong similarities between symmetric set theory the above mentioned work on data 
types (especially that of Dunlaing and Yap) there is also an important difference. Symmetric set theory bases 
the notion of isomorphism on points rather than types. When the notion of isomorphism is based on points 
the isomorphism class of an object is an essential property of the object and does not depend on viewing that 
object as an instance of some type (or as a member of some category). Thus it can be argued that tire need for 
types (or categories) in defining the notion of isomorphism is a byproduct of the fact that ur _ elements were 
left out of set theory. 
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7. APPENDIX: CONSISTENCY AND CHARACTERIZATION THEOREMS 

/-s 

This section contains some results concerning the existence and nature of universes <U €> of 
symmetric sets. It is shown that the existence of a universe <U €> of symmetric sets is equivalent to the 
existence of a strongly inaccessible cardinal. It is also shown that a universe <U €> of symmetric sets is 
completely characterized by its "height" and "width". These results provide insight into two obvious 
questions. First, are the axioms for symmetric sets consistent? Second, to what extent are die axioms 
categorical, i.e. under what conditions are two universes isomorphic? 

7.1. Consistency and Strongly Inaccesible Cardinals 

There does not seem to be any sadsfactory proof that there exists a universe of symmetric sets. 
However a simple condition can be given which is equivalent to the consistency of the axioms of symmetric 
set theory. In essence the consistency of the axioms depends purely on the consistency of axioms three 
through five (infinity, power set, and union). Axioms diree through five characterize what is known as a 
strongly inaccessible cardinal. Thus the axioms of die theory of symmetric sets are consistent just in case diere 
exists a strongly inaccessible cardinal. This result is of interest because strongly inaccessible cardinals have 
been studied in the context of Zermello-Fraenkel set theory and it is generally believed that it is impossible to 
prove that strongly inaccessible cardinals do not exist. Thus there is strong evidence that the theory of 
symmetric sets will never be proven inconsistent. 

/""N Definition : A set U will be said to have a strongly inaccessible cardinality just in case it meets 

the following conditions: 

1) U is uncountably infinite 

2) If C is a subset of U smaller than U dien the power set of C is also smaller dian U. 

3) For any family F of subsets of U if F is smaller than U and every member of F is smaller 

than U then the union of all members of F is smaller dian U. 

The main result of this section will be proven in two parts. First it will be shown that any set U 
which is larger dian some inaccessible cardinal can be expanded to a model <U €> of axioms one through six 
(the foundation axiom is initially ignored). It will then be shown diat any model <U €> of axioms one 
through six contains a substructure <U’ O which also satisfies axiom seven (foundation). These two results 
will lead directly to the main result drat a set U can be expanded to a universe <U O just in case U is larger 
dian some strongly inaccessible cardinal. 

For any set C let Sm(C) be the family of all non-empty subsets of C which are smaller than C. The 
following lemma concerning Sm(C) is a standard result of set dieory and will be stated without proof: 




Lemma A.I: For any infinite set C, Sm(C) is die same size as C. 
The next lemma is a direct predecessor to the main result of this section. 
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Lemma A.2: Any set U which is larger than some strongly inaccessible cardinal can be 
/""N. expanded to a model <U €> of axioms one dirough six. 

Proof: To ensure that there will be a large number of points a subset P(U) of U is chosen such that 
both P(U) and U-P(U) are the same size as U (this can always be done for any infinite set U). Since 
U is larger than some strongly inaccessible cardinal we can choose some subset IT of U with a 
strongly inaccessible cardinality. Let Sm(U U’) denote the family of all non-empty subsets of U 
which are smaller than U\ Since since Sm(U U’) is a subset of Sm(U) lemma A.l implies that 
Sm(U U’) can be no larger than U. On the other hand Sm(U U’) contains all die singleton subsets 
of U and thus Sm(U U’) is as large as U. Thus Sm(U U’) has die same cardinality as U and there is 
a bijection f from U-P(U) to Sm(U U’) such that each clement x of U-P(U) represents some 
non-empty subset ffx) of U which is smaller than U’ and every such subset has a unique such 
representation. The relation € is now defined such that x€y just in case y is in U-P(U) and and x is 
in ffy). The resulting structure <U €> clearly satisfies axioms one and two (cxtensionality and 
strong comprehension). A subset of U is small in <U O just in case it is smaller than U\ The 
definition of a set with strongly inaccessible cardinality now directly implies that <U €> satisfies 
axioms three through five (infinity, power set, and union). The fact drat P(U) is as big as U implies 
that <U €> satisfies axiom six. 

It can also be shown that any model of axioms one through six can be used to generate a model which also 
satisfies foundation. 


Lemma A.3: Any model <U €> of axioms one through six contains a substructure which is a 
model of axioms one through seven. 

Proof: An element x of U will be called a well founded element if there are no infinitely 
decreasing € chains containing it. Let U* be the subset of U consisting of die well founded 
elements of U. The substructure <U’ €> clearly satisfies die foundation axiom so it is sufficient to 
show that it also satisfies axioms one through six. Note that every point is well founded so P(U) is 
contained in U' and dius by the point comprehension axiom U’ is a large subset of U. If x is a well 
founded element of U then every y such that y£x is also well founded and thus any well founded 
element represents die same subset of U whether it is viewed as an element of <U O or as an 
element of <U’ €>, Since no two elements of U represent the same set under <U €> no two 
elements of U’ represent the same set under <U’ €> and so <U' €> satisfies cxtensionality. To show 
that <U’ €> satisfies the strong comprehension axiom let C be any subset of U’ which is small with 
respect to <U £>, i.e. there is an x in U which represents C. Since every member of C is well 
founded x must also be well founded and thus x is in U’ and thus C is represented in <U’ O. Thus 
every subset of U’ which is small with respect to <U £> is represented in <U’ £>. On the other 
hand no set which is large in <U £> can be represented in <U’ €>. Thus a subset of U’ is 
represented in <U* €> just in case it is small with respect to <U £>. The fact that U’ is large and diat 
a subset of U’ is small in <U’ 6> just in case it is small in <U €> immediately implies that <U’ €> 
satisfies the axioms of infinity, power set and union. Since U’ contains P(U) the number of points 
in U’ is large so <U’ £> also satisfies the point comprehension axiom. 

Lemmas A.2 and A.3 lead directly to die main result of this section. 

Theorem A.4: A set U can be expanded to a model <U €> of the axioms of symmetric set 
theory just in case U is larger than some strongly inaccessible cardinal. 

Proof: If U is larger than some strongly inaccessible cardinal then by lemma A.2 it can be 
expanded to a model <U €> of axioms one through six such that P(U) is the same size as U. By 
lemma A.3 there is a substructure <U* £> of <U €> which contains P(U) and which satisfies all of 
the axioms of symmetric set theory. But since U’ has the same cardinality as U it is also possible to 



7. APPENDIX 


-26- 


August 1983 


directly extend U to a model <U €> of all of the axioms. 

On die other hand if U can be expanded to a model <U €> of axioms two through five (strong 
comprehension, infinity, power set, and union) then it is easily shown that U must be larger than 
some strongly inaccessible cardinal. 


7.2. The Height and Width of a Universe 


A universe <U €> of symmetric sets is characterized (up to isomorphism) by two "numbers", its 
height and its width. The height and width of a universe <U €> are defined as follows: 

Definition : The width of a universe <U €> is defined to be the cardinality of its set of points 
P(U). A subset C of U will be said to be minimally large in <U O if no subset of C which is 
smaller than C is large in <U €>. The height of a universe <U €> is defined to be the 
cardinality of any minimally large subset of U. 

The following lemma can be demonstrated directly from the comprehension axioms. 

Lemma A.5: The height of any universe <U €> is a strongly inaccessible cardinal. 

The constructions used in the consistency theorems of the previous section show that for any 
strongly inaccessible cardinal there is a universe with that height. Since P(U) is required to be a large set the 
width of a universe must always be at least as large as its height. The constructions used in the consistency 
theorems further show that die width of a universe can be any cardinality larger than its height. 

A universe can be thought of as a rectangle which is no higher than its width. The points of the 
universe should be thought of as lying along the bottom edge of tills rectangle. The main dieorem of this 
section can be proven directly. 


Theorem A.6: Any two universes of symmetric sets with die same height and widdi are 
isomorphic. 

Proof. Let the universe be <U| €]> and <U 2 Since both universes have the same height a 

set is small with respect to one universe just in case it is small with respect to die other universe. 
Since P(Uj) and PfL^) are the same size diere exists a bijection p from T(Uj) to P(U 2 ). The 

function p can be extended to a function a from ail of into U 2 via die following inductive 
definition: 

<x(p) = p(p) for any point pin P(Up 

cr(x) = the representadon for {o-(y): yC jx} for x not a point 

The set on the right side of the second equation is guaranteed to have a representation in U 2 
because it is no larger than the set represented by x. It is easy to show by induction on x under 
€| ^ that o(y)€ 2 o(x) just in case yC jX. To show that a is a bijection it is sufficient to construct an 
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inverse function ct' 1 such 'that for any x in Uj cr' 1 ^)) equals x and for any y in U 2 aCtr'^y)) 

equals y. The inverse function cr"* is defined by extending p' 1 from P(U 2 ) to all of U via a relation 

analogous to that above. The two conditions relating a and cf 1 can then be proven by induction on 
and € 2 + respectively. 

There a few other results which help to characterize a universe <U €>. These results will be stated 
briefly without proof. First it can be shown that in general the size of U equals the size of P(U) (which is at 
least as large as tire height of U). Second the notion of "rank" used in ZF set theory can also be defined for 
symmetric sets. The details of this definition are not important but one result concerning a characterization of 
small sets will be mentioned. For any subset C of U let P(C) be the union over x in C of P(x). It turns out 
that a subset C of U is small just in case P(C) is small and die rank of C is less than die height of die universe. 



